devoted to Concept Theory , Classification , Indexing and Knowledge Representation Contents
نویسندگان
چکیده
The use of Knowledge Organization Systems (KOSs) in aggregated metadata collections facilitates the implementation of search mechanisms operating on the same term or keyphrase space, thus preparing the ground for improved browsing, more accurate retrieval and better user profiling. Automatic thesaurus-based keyphrase extraction appears to be an inexpensive tool to obtain this information, but the studies on its effectiveness are scattered and do not consider the practical applicability of these techniques compared to the quality obtained by involving human experts. This paper presents an evaluation of keyphrase extraction using the KEA software and the AGROVOC vocabulary on a sample of a large collection of metadata in the field of agriculture from the AGRIS database. This effort includes a double evaluation, the classical automatic evaluation based on precision and recall measures, plus a blind evaluation aimed to contrast the quality of the keyphrases extracted against expert-provided samples and against the keyphrases originally recorded in the metadata. Results show not only that KEA outperforms humans in matching the original keyphrases, but also that the quality of the keyphrases extracted was similar to those provided by humans. Beaudoin, Joan, Ménard, Elaine. Objects of Human Desire: The Organization of Pornographic Videos on Free Websites. Knowledge Organization. 42(2), 90-101. 38 references. Abstract: Pornographic content is pervasive on the Internet; nevertheless, our knowledge concerning how this content is organized, described, and accessed by individuals is limited. Human sexuality has been a problematic topic within the field of library and information science (LIS). Thus, this study investigates the terminology used to describe pornographic videos. More specifically, this study explores the categories available to access the videos and formulates a framework within which we can begin to address materials of a sexual nature. For the study presented below data was extracted from 20 free websites to explore the categories used for access, the search mechanisms provided by the sites, and the organizational patterns used for the pornographic video content. This project contributes to an area of research that remains relatively unexplored, and provides useful insights into the organization and terminology surrounding what is inarguably one of the most controversial, and yet ubiquitous, types of material accessible via the Internet. Hajibayova, Lala, Jacob, Elin K. Factors Influencing UserGenerated Vocabularies: How Basic are Basic Level Terms? Knowledge Organization. 42(2), 102-112. 60 references. Pornographic content is pervasive on the Internet; nevertheless, our knowledge concerning how this content is organized, described, and accessed by individuals is limited. Human sexuality has been a problematic topic within the field of library and information science (LIS). Thus, this study investigates the terminology used to describe pornographic videos. More specifically, this study explores the categories available to access the videos and formulates a framework within which we can begin to address materials of a sexual nature. For the study presented below data was extracted from 20 free websites to explore the categories used for access, the search mechanisms provided by the sites, and the organizational patterns used for the pornographic video content. This project contributes to an area of research that remains relatively unexplored, and provides useful insights into the organization and terminology surrounding what is inarguably one of the most controversial, and yet ubiquitous, types of material accessible via the Internet. Hajibayova, Lala, Jacob, Elin K. Factors Influencing UserGenerated Vocabularies: How Basic are Basic Level Terms? Knowledge Organization. 42(2), 102-112. 60 references. Abstract: Studies of user-generated tagging vocabularies (e.g., Yoon 2009) suggest that tag agreement across users is due to wide-spread use of basic level category terms. This study investigated whether differences in the superordinate, subordinate or basic level of abstraction were influenced by resource content. Analysis of 7617 tags assigned by 40 participants to 36 online resources representing four content categories (i.e., TOOL, FRUIT, CLOTHING, VEHICLE) found significant differences in the frequency of occurrence of subordinate and basic level tags assigned to resources in the FRUIT content category and of superordinate and basic level tags assigned to resources in the CLOTHING content category. This study suggests that variation in the level of abstraction of content related tags is natural in that perception and understanding arise out of the individual's contextualized experiences of engaging with objects. Studies of user-generated tagging vocabularies (e.g., Yoon 2009) suggest that tag agreement across users is due to wide-spread use of basic level category terms. This study investigated whether differences in the superordinate, subordinate or basic level of abstraction were influenced by resource content. Analysis of 7617 tags assigned by 40 participants to 36 online resources representing four content categories (i.e., TOOL, FRUIT, CLOTHING, VEHICLE) found significant differences in the frequency of occurrence of subordinate and basic level tags assigned to resources in the FRUIT content category and of superordinate and basic level tags assigned to resources in the CLOTHING content category. This study suggests that variation in the level of abstraction of content related tags is natural in that perception and understanding arise out of the individual's contextualized experiences of engaging with objects. Hjørland, Birger. Theories are Knowledge Organizing Systems (KOS). Knowledge Organization. 42(2), 113-128. 100 references. Abstract: The notion “theory” is a neglected concept in the field of information science and knowledge organization (KO) as well as generally in philosophy and in many other fields, although there are exceptions from this general neglect (e.g., the so-called “theory theory” in cognitive psychology). This article introduces different conceptions of “theory” and argues that a theory is a statement or a conception, which is considered open to be questioned and which is connected with background assumptions. Theories form interconnected systems of grand, middle rank and micro theories and actions, practices and artifacts are theory-laden. The concept of knowledge organization system (KOS) is briefly introduced and discussed. A theory is a form of KOS and theories are the point of departure of any KOS. It is generally understood in KO that concepts are the units of KOSs, but the theory-dependence of concepts brings theories to the forefront in analyzing concepts and KOSs. The study of theories should therefore be given a high priority within KO concerning the construction and evaluation of KOSs. The notion “theory” is a neglected concept in the field of information science and knowledge organization (KO) as well as generally in philosophy and in many other fields, although there are exceptions from this general neglect (e.g., the so-called “theory theory” in cognitive psychology). This article introduces different conceptions of “theory” and argues that a theory is a statement or a conception, which is considered open to be questioned and which is connected with background assumptions. Theories form interconnected systems of grand, middle rank and micro theories and actions, practices and artifacts are theory-laden. The concept of knowledge organization system (KOS) is briefly introduced and discussed. A theory is a form of KOS and theories are the point of departure of any KOS. It is generally understood in KO that concepts are the units of KOSs, but the theory-dependence of concepts brings theories to the forefront in analyzing concepts and KOSs. The study of theories should therefore be given a high priority within KO concerning the construction and evaluation of KOSs.
منابع مشابه
Image Classification via Sparse Representation and Subspace Alignment
Image representation is a crucial problem in image processing where there exist many low-level representations of image, i.e., SIFT, HOG and so on. But there is a missing link across low-level and high-level semantic representations. In fact, traditional machine learning approaches, e.g., non-negative matrix factorization, sparse representation and principle component analysis are employed to d...
متن کاملSemantic Indexing Approach of a Corpora Based On Ontology
The growth in the volume of text data such as books and articles in libraries for centuries has imposed to establish effective mechanisms to locate them. Early techniques such as abstraction, indexing and the use of classification categories have marked the birth of a new field of research called "Information Retrieval". Information Retrieval (IR) can be defined as the task of defining models a...
متن کاملExplaining the Methods of Architecture Representation Using Semiotic Analysis (Umberto Eco's Theory of Architecture Codes)
: In this paper, it is tried to explain the concept of representation and architectural representation through a qualitative methodology, approach its procedure for gradual creation in architecture and then according to scholars and to obtain the effect of this concept in the process of architectural facts the concepts are presented. In addition, it is referred to theories and practical texts b...
متن کاملخوشهبندی اسناد مبتنی بر آنتولوژی و رویکرد فازی
Data mining, also known as knowledge discovery in database, is the process to discover unknown knowledge from a large amount of data. Text mining is to apply data mining techniques to extract knowledge from unstructured text. Text clustering is one of important techniques of text mining, which is the unsupervised classification of similar documents into different groups. The most important step...
متن کاملConstruction of Knowledge Base for Automatic Indexing and Classification Based on Chinese Library Classification
Class number, descriptor and keyword are three kinds of subject concept identifiers, among which there exist some concept ual mapping relationships, i.e. compatibility. According to this principle, we construct a CLC Knowledge Base on the basis of Chinese Library Classification for automatic indexing and classification. We compare it with the CLC system to illuminate its obvious advantages over...
متن کاملDeep Unsupervised Domain Adaptation for Image Classification via Low Rank Representation Learning
Domain adaptation is a powerful technique given a wide amount of labeled data from similar attributes in different domains. In real-world applications, there is a huge number of data but almost more of them are unlabeled. It is effective in image classification where it is expensive and time-consuming to obtain adequate label data. We propose a novel method named DALRRL, which consists of deep ...
متن کامل